智能论文笔记

OpenPneu: Compact platform for pneumatic actuation with multi-channels

Yingjun Tian , Renbo Su , Xilong Wang , Nur Banu Altin , Guoxin Fang , Charlie C. L. Wang

分类：机器人

2022-09-22

本文提出了一个紧凑的系统OpenPneu，以支持软机器人多腔的气动驱动。系统中使用微型泵来生成气流，因此不需要额外的输入，因为需要压缩空气。我们的系统执行模块化设计以提供良好的可扩展性，这已在具有十个空气通道的原型上证明。OpenPNEU的每个空气通道都配备了通货膨胀和通气功能，可提供从正到负的全范围压力供应，最大流速为1.7 L/min。我们的系统内置了对压力的高精度闭环控制，以实现稳定而有效的动态性能。提供了Python中的开源控制接口和API。我们还证明了OpenPneu在三个软机器人系统上的功能，最多10个腔室。

translated by 谷歌翻译

Optimizing out-of-plane stiffness for soft grippers

Renbo Su , Yingjun Tian , Mingle Du , Charlie C. L. Wang

分类：机器人

2022-07-17

在本文中，我们介绍了一个数据驱动的框架，以优化软抓地力的平面外刚度，以实现机械性能，如难以扭动且易于弯曲。在软气动弯曲执行器（SPBA）的设计中证明了该方法的有效性。首先，定义了一个新的目标函数来定量评估平面外刚度以及弯曲性能。然后，对SPBA设计的参数模型进行灵敏度分析，以确定有限元分析（FEA）的优化设计参数。为了启用数值优化的计算，采用数据驱动的方法来学习成本函数，该成本函数直接代表平面外刚度作为设计变量的可区分函数。一种基于梯度的方法用于最大化SPBA的平面外刚度，同时确保特定的弯曲性能。我们方法的有效性已在3D打印的握把上进行的物理实验中得到了证明。

translated by 谷歌翻译

Collision-Aware Fast Simulation for Soft Robots by Optimization-Based Geometric Computing

Guoxin Fang , Yingjun Tian , Andrew Weightman , Charlie C. L. Wang

分类：机器人

2022-03-03

软机器人由于其机械合规性可以安全地与环境相互作用。在现代的软机器人的现代设计中，自我碰撞也用于在不同的任务中提高其性能。但是，开发一个可以很好地处理碰撞响应的高效且可靠的模拟器，仍然是软机器人技术研究中的一项艰巨任务。本文基于几何优化提供了一个碰撞感知的模拟器，其中我们开发了一种高效且逼真的碰撞检查 /响应模型，该模型包含了超弹性材料特性。软机器人的驱动变形和碰撞响应都是基于几何目标的。可以通过最小化基于几何的目标函数来获得软机器人的无碰撞主体。与基于FEA的物理模拟不同，所提出的管道的计算成本要低得多。此外，在处理具有较大体积变化的软机器人时，适用自适应重新捕获以提高收敛性。在不同的软机器人上进行了实验测试，以验证我们的方法的性能。

translated by 谷歌翻译

KoopmanLab: A PyTorch module of Koopman neural operator family for solving partial differential equations

Wei Xiong , Muyuan Ma , Pei Sun , Yang Tian

分类：机器学习

2023-01-03

Given the increasingly intricate forms of partial differential equations (PDEs) in physics and related fields, computationally solving PDEs without analytic solutions inevitably suffers from the trade-off between accuracy and efficiency. Recent advances in neural operators, a kind of mesh-independent neural-network-based PDE solvers, have suggested the dawn of overcoming this challenge. In this emerging direction, Koopman neural operator (KNO) is a representative demonstration and outperforms other state-of-the-art alternatives in terms of accuracy and efficiency. Here we present KoopmanLab, a self-contained and user-friendly PyTorch module of the Koopman neural operator family for solving partial differential equations. Beyond the original version of KNO, we develop multiple new variants of KNO based on different neural network architectures to improve the general applicability of our module. These variants are validated by mesh-independent and long-term prediction experiments implemented on representative PDEs (e.g., the Navier-Stokes equation and the Bateman-Burgers equation) and ERA5 (i.e., one of the largest high-resolution data sets of global-scale climate fields). These demonstrations suggest the potential of KoopmanLab to be considered in diverse applications of partial differential equations.

translated by 谷歌翻译

Towards Modeling and Influencing the Dynamics of Human Learning

Ran Tian , Masayoshi Tomizuka , Anca Dragan , Andrea Bajcsy

分类：机器人 | 人工智能

2023-01-02

Humans have internal models of robots (like their physical capabilities), the world (like what will happen next), and their tasks (like a preferred goal). However, human internal models are not always perfect: for example, it is easy to underestimate a robot's inertia. Nevertheless, these models change and improve over time as humans gather more experience. Interestingly, robot actions influence what this experience is, and therefore influence how people's internal models change. In this work we take a step towards enabling robots to understand the influence they have, leverage it to better assist people, and help human models more quickly align with reality. Our key idea is to model the human's learning as a nonlinear dynamical system which evolves the human's internal model given new observations. We formulate a novel optimization problem to infer the human's learning dynamics from demonstrations that naturally exhibit human learning. We then formalize how robots can influence human learning by embedding the human's learning dynamics model into the robot planning problem. Although our formulations provide concrete problem statements, they are intractable to solve in full generality. We contribute an approximation that sacrifices the complexity of the human internal models we can represent, but enables robots to learn the nonlinear dynamics of these internal models. We evaluate our inference and planning methods in a suite of simulated environments and an in-person user study, where a 7DOF robotic arm teaches participants to be better teleoperators. While influencing human learning remains an open problem, our results demonstrate that this influence is possible and can be helpful in real human-robot interaction.

translated by 谷歌翻译

ReSQueing Parallel and Private Stochastic Convex Optimization

Yair Carmon , Arun Jambulapati , Yujia Jin , Yin Tat Lee , Daogao Liu , Aaron Sidford , Kevin Tian

分类：机器学习 | (统计)机器学习

2023-01-01

We introduce a new tool for stochastic convex optimization (SCO): a Reweighted Stochastic Query (ReSQue) estimator for the gradient of a function convolved with a (Gaussian) probability density. Combining ReSQue with recent advances in ball oracle acceleration [CJJJLST20, ACJJS21], we develop algorithms achieving state-of-the-art complexities for SCO in parallel and private settings. For a SCO objective constrained to the unit ball in $\mathbb{R}^d$, we obtain the following results (up to polylogarithmic factors). We give a parallel algorithm obtaining optimization error $\epsilon_{\text{opt}}$ with $d^{1/3}\epsilon_{\text{opt}}^{-2/3}$ gradient oracle query depth and $d^{1/3}\epsilon_{\text{opt}}^{-2/3} + \epsilon_{\text{opt}}^{-2}$ gradient queries in total, assuming access to a bounded-variance stochastic gradient estimator. For $\epsilon_{\text{opt}} \in [d^{-1}, d^{-1/4}]$, our algorithm matches the state-of-the-art oracle depth of [BJLLS19] while maintaining the optimal total work of stochastic gradient descent. We give an $(\epsilon_{\text{dp}}, \delta)$-differentially private algorithm which, given $n$ samples of Lipschitz loss functions, obtains near-optimal optimization error and makes $\min(n, n^2\epsilon_{\text{dp}}^2 d^{-1}) + \min(n^{4/3}\epsilon_{\text{dp}}^{1/3}, (nd)^{2/3}\epsilon_{\text{dp}}^{-1})$ queries to the gradients of these functions. In the regime $d \le n \epsilon_{\text{dp}}^{2}$, where privacy comes at no cost in terms of the optimal loss up to constants, our algorithm uses $n + (nd)^{2/3}\epsilon_{\text{dp}}^{-1}$ queries and improves recent advancements of [KLL21, AFKT21]. In the moderately low-dimensional setting $d \le \sqrt n \epsilon_{\text{dp}}^{3/2}$, our query complexity is near-linear.

translated by 谷歌翻译

Can Direct Latent Model Learning Solve Linear Quadratic Gaussian Control?

Yi Tian , Kaiqing Zhang , Russ Tedrake , Suvrit Sra

分类：机器学习 | (统计)机器学习

2022-12-30

We study the task of learning state representations from potentially high-dimensional observations, with the goal of controlling an unknown partially observable system. We pursue a direct latent model learning approach, where a dynamic model in some latent state space is learned by predicting quantities directly related to planning (e.g., costs) without reconstructing the observations. In particular, we focus on an intuitive cost-driven state representation learning method for solving Linear Quadratic Gaussian (LQG) control, one of the most fundamental partially observable control problems. As our main results, we establish finite-sample guarantees of finding a near-optimal state representation function and a near-optimal controller using the directly learned latent model. To the best of our knowledge, despite various empirical successes, prior to this work it was unclear if such a cost-driven latent model learner enjoys finite-sample guarantees. Our work underscores the value of predicting multi-step costs, an idea that is key to our theory, and notably also an idea that is known to be empirically valuable for learning state representations.

translated by 谷歌翻译

A Class-wise Non-salient Region Generalized Framework for Video Semantic Segmentation

Yuhang Zhang , Shishun Tian , Muxin Liao , Zhengyu Zhang , Wenbin Zou , Chen Xu

分类：计算机视觉

2022-12-29

Video semantic segmentation (VSS) is beneficial for dealing with dynamic scenes due to the continuous property of the real-world environment. On the one hand, some methods alleviate the predicted inconsistent problem between continuous frames. On the other hand, other methods employ the previous frame as the prior information to assist in segmenting the current frame. Although the previous methods achieve superior performances on the independent and identically distributed (i.i.d) data, they can not generalize well on other unseen domains. Thus, we explore a new task, the video generalizable semantic segmentation (VGSS) task that considers both continuous frames and domain generalization. In this paper, we propose a class-wise non-salient region generalized (CNSG) framework for the VGSS task. Concretely, we first define the class-wise non-salient feature, which describes features of the class-wise non-salient region that carry more generalizable information. Then, we propose a class-wise non-salient feature reasoning strategy to select and enhance the most generalized channels adaptively. Finally, we propose an inter-frame non-salient centroid alignment loss to alleviate the predicted inconsistent problem in the VGSS task. We also extend our video-based framework to the image-based generalizable semantic segmentation (IGSS) task. Experiments demonstrate that our CNSG framework yields significant improvement in the VGSS and IGSS tasks.

translated by 谷歌翻译

Parsing Objects at a Finer Granularity: A Survey

Yifan Zhao , Jia Li , Yonghong Tian

分类：计算机视觉

2022-12-28

Fine-grained visual parsing, including fine-grained part segmentation and fine-grained object recognition, has attracted considerable critical attention due to its importance in many real-world applications, e.g., agriculture, remote sensing, and space technologies. Predominant research efforts tackle these fine-grained sub-tasks following different paradigms, while the inherent relations between these tasks are neglected. Moreover, given most of the research remains fragmented, we conduct an in-depth study of the advanced work from a new perspective of learning the part relationship. In this perspective, we first consolidate recent research and benchmark syntheses with new taxonomies. Based on this consolidation, we revisit the universal challenges in fine-grained part segmentation and recognition tasks and propose new solutions by part relationship learning for these important challenges. Furthermore, we conclude several promising lines of research in fine-grained visual parsing for future research.

translated by 谷歌翻译

Part-guided Relational Transformers for Fine-grained Visual Recognition

Yifan Zhao , Jia Li , Xiaowu Chen , Yonghong Tian

分类：计算机视觉

2022-12-28

Fine-grained visual recognition is to classify objects with visually similar appearances into subcategories, which has made great progress with the development of deep CNNs. However, handling subtle differences between different subcategories still remains a challenge. In this paper, we propose to solve this issue in one unified framework from two aspects, i.e., constructing feature-level interrelationships, and capturing part-level discriminative features. This framework, namely PArt-guided Relational Transformers (PART), is proposed to learn the discriminative part features with an automatic part discovery module, and to explore the intrinsic correlations with a feature transformation module by adapting the Transformer models from the field of natural language processing. The part discovery module efficiently discovers the discriminative regions which are highly-corresponded to the gradient descent procedure. Then the second feature transformation module builds correlations within the global embedding and multiple part embedding, enhancing spatial interactions among semantic pixels. Moreover, our proposed approach does not rely on additional part branches in the inference time and reaches state-of-the-art performance on 3 widely-used fine-grained object recognition benchmarks. Experimental results and explainable visualizations demonstrate the effectiveness of our proposed approach. The code can be found at https://github.com/iCVTEAM/PART.

translated by 谷歌翻译